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The affinity-mediated selection of large libraries of DNA-encoded small molecules is increasingly being used to 
initiate drug discovery programs. We present universal methods for the encoding of such libraries using the chemical 
ligation of oligonucleotides. These methods may be used to record the chemical history of individual library members 
during combinatorial synthesis processes. We demonstrate three different chemical ligation methods as examples of 
information recording processes (writing) for such libraries and two different cDNA-generation methods as examples 
of information retrieval processes (reading) from such libraries. The example writing methods include uncatalyzed and 
Cu(l)-catalyzed alkyne-azide cycloadditions and a novel photochemical thymidine-psoralen cycloaddition. The first 
reading method "relay primer-dependent bypass" utilizes a relay primer that hybridizes across a chemical ligation junc- 
tion embedded in a fixed-sequence and is extended at its 3'-terminus prior to ligation to adjacent oligonucleotides. The 
second reading method "repeat-dependent bypass" utilizes chemical ligation junctions that are flanked by repeated 
sequences. The upstream repeat is copied prior to a rearrangement event during which the 3'-terminus of the cDNA 
hybridizes to the downstream repeat and polymerization continues. In principle these reading methods may be used 
with any ligation chemistry and offer universal strategies for the encoding (writing) and interpretation (reading) of DNA- 
encoded chemical libraries. 



Introduction 

The use of DNA-encoded chemical libraries permits the 
simultaneous interrogation of large numbers of small-molecules 
using affinity-mediated selection methodologies. 1 " 11 Such librar- 
ies have recently been reported to have yielded potent small-mol- 
ecule inhibitors for a range of therapeutic target proteins 10,12 " 18 
including for challenging targets such as the inhibition of pro- 
tein-protein interactions. 19 " 22 During the chemical synthesis of 
DNA-encoded libraries of small-molecules the chemical his- 
tory of individual library members is either recorded using the 
sequences of oligonucleotide tags that are ligated to the library 
member within each of multiple chemistry-specific compart- 
ments or directed using a pre-existing oligonucleotide template. 
During DNA-recorded chemical library generation successive 
cycles of split, encoding by tagging, chemistry and pool generate 
large combinatorial mixtures of different small-molecules, the 
identities of each of which may be inferred by determining the 
sequence of the attached concatenated oligonucleotide tag set. 
A range of oligonucleotide tagging strategies has been reported 



to be suitable for the generation of DNA-recorded libraries 
including solid-phase coupled peptide and oligonucleotide syn- 
thesis, 1,23 the enzymatic ligation of single-stranded oligonucle- 
otides, 2 the enzymatic extension of hybridized single-stranded 
oligonucleotides, 24 the enzymatic ligation of double-stranded 
oligonucleotides 10 and the chemical ligation of single-stranded 
oligonucleotides. 25 Related strategies have been reported for the 
generation of DNA-directed libraries using template-directed 
chemical synthesis on a pre-existing template 3,5,11 or hybridiza- 
tion-dependent routing of pre-existing templates to chemistry- 
specific compartments. 6 " 8 

Many of these oligonucleotide tagging schemes suffer from 
significant limitations. Hybridization-mediated strategies are 
limited to small split sizes because large differences need to exist 
between the sequences of all members of the encoding oligo- 
nucleotide set in order to reduce mis-hybridization rates. And 
some of these strategies require the covalent conjugation of each 
chemical building-block to a different oligonucleotide tag which 
will become laborious for large sets of chemical building blocks. 
Also, the use of enzymes to effect ligation or extension requires 
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Figure 1. Mechanism of cDNA generation from chemically ligated tags using relay primer-mediated bypass on a photochemically ligated non-travers- 
able template. A terminal primer is annealed to the 3'-terminus of the chemically ligated oligonucleotide and a second 5'-monophosphorylated primer 
(relay primer) is annealed across the chemical ligation junction. Extension of both primers and ligation generates full-length cDNA including potentially 
encoding sequences from both upstream and downstream of the photochemical ligation junction. 



the alternate establishment of chemistry-compatible and enzyme- 
compatible conditions often within large numbers of low-volume 
compartments and can suffer from the inhibition of enzyme 
activity by residual chemistry components. 

The chemical ligation of oligonucleotides is an attractive alter- 
native encoding strategy. Chemical reactions are generally more 
versatile than enzymatic reactions and in many instances may be 
performed under a range of solution conditions which could obvi- 
ate the need for buffer exchange within thousands of low-volume 
compartments - as has been reported for some enzymatic tagging 
strategies. 10 One of the more significant limitations of chemical 
ligation-mediated encoding schemes is the apparent need to gen- 
erate a chemical linkage between two oligonucleotides that may 
be traversed by a template-dependent polymerase in order that tag 
identities and associations may be determined by sequencing. In 
this paper we demonstrate two different cDNA-generation (read- 
ing) strategies for chemically ligated oligonucleotides — neither of 
which require that a polymerase be able to translocate through 
the chemical ligation junction. We also demonstrate three differ- 
ent example chemical ligation strategies that may be read by these 
processes. In principle both of these cDNA-generation strategies 
permit the use of any chemical bond-forming reaction to support 
an oligonucleotide tagging strategy for DNA-encoded chemical 
library generation. 

Strategies for generating chemically ligated oligonucleotides 
that polymerases may translocate through have also been reported 
and also offer alternative approaches for the synthesis of DNA- 
encoded chemical libraries. Examples include the use of cyano- 
gen bromide or water-soluble carbodiimides to generate native 
phosphodiester linkages from 5'-monophospho and 3'-hydroxyl 
oligonucleotides, 26,27 the use of 5'Todo and 3'-Phosphorothio 



Table 1. Yields of psoralen-mediated oligonucleotide-templated photo- 



chemical oligonucleotide ligations under different solution conditions 



Buffer 


Psoralen-thymidine ligation conversion 


pH 5.5 Phosphate 


65% 


pH 7.0 Phosphate 


88% 


pH 9.4 Borate 


80% 



oligonucleotides to generate phosphorothiodiester linkages, 28 
the use of Cu (I) -catalyzed triazole synthesis with 5'-propargyl 
and 3'-azido oligonucleotides 25 and the use of Cu (I) -catalyzed 
triazole synthesis with 5'-azido and 3'-propargyl oligonucle- 
otides. 25,30 The utility of such approaches for the generation of 
DNA-encoded libraries of small-molecules is dependent upon the 
extent to which tag polymerization can be avoided by for example 
by making chemical ligations template-dependent, or the devel- 
opment of a suitable protection strategy, or the use of pairs of 
orthogonal chemistries. Challenges to these approaches are likely 
to be experienced with regard to yield, removal of unreacted tags 
and interference with the reaction scheme used to generate the 
encoded chemical entity, but we believe that these can be over- 
come and we are also exploring such approaches. 25 In this paper 
we report on strategies for the generation of soluble unimolecular 
oligonucleotide-encoded information (writing) and the deter- 
mination of this information (reading) using chemical linkages 
between oligonucleotides that are not traversable by polymerases. 
We present three chemical ligation methods and two cDNA- 
generation methods by which this may be achieved and support 
each with experimental demonstration of both the writing and 
the reading processes. 
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Generation of cDNA from a chemically ligated 
conjugate using relay primer-dependent bypass 

Relay-primer dependent bypass utilizes a "relay 
primer" and coupled template-dependent polymer- 
ization and ligation as a means to generate contigu- 
ous cDNA sequences that can in turn be amplified, 
cloned and sequenced. In this approach both ter- 
mini of the concatenated tag set contain sufficient 
fixed sequence for the purposes of primer-binding 
and each chemical ligation junction is flanked on 
either side by sufficient fixed sequence for the pur- 
poses of relay primer-binding. A relay primer is our 
term for an oligonucleotide that hybridizes to a pair 
of chemically ligated tags in a manner that bridges 
their ligation junction. In the presence of a non- 
strand displacing template-dependent polymerase 
and annealed 5'-terminal and 5'-phosphorylated 
relay primers each primer is extended up to the 
5 '-terminus of the next. The addition of a ligase then 
seals the nick between these and generates a con- 
tiguous cDNA that may then be amplified, cloned 
and sequenced as part of the reading process. This 
process is shown in Figure 1. With multiple relay 
primers the sequences of multiple concatenated tags 
may be copied into a single cDNA. We refer to this 
method as relay primer-dependent bypass and note 
it's similarity to the manner in which Okazaki frag- 
ments are generated on the lagging template strand 
and then ligated together during DNA replication. 31 

In order to demonstrate relay primer-dependent 
bypass we used a psoralen-mediated photochemical 
ligation strategy to generate a cyclobutane adduct 
between an adjacent 3 '-thymidine and a 5 '-psoralen 
in a template-dependent reaction using a splint oli- 
gonucleotide to co-locate the photo-reactive oli- 
gonucleotide termini as shown in Figure 2A and 
B. The stereochemistry of the thymidine-psoralen 
adduct may be either cis or trans. 32 The linkage 
thus formed lacks a contiguous sugar-phosphate 
backbone and does not resemble a wild-type internucleotide 
linkage. This ligation is efficient as indicated in Figure 2B with 
conversions exceeding 90% and the conjugate of (1) and (4) 
was synthesized with a yield exceeding 80% and an observed 
deconvoluted MW of 15,819 Da (calculated MW 15,813 Da). 
Photochemistry of this kind is orthogonal to most other chemi- 
cal reactions and may be performed under a range of solution 
conditions - we have determined that it occurs with high yield at 
pH values as low as 5.5 and as high as 9.4 as shown in Table 1. 

Oligonucleotides modified with psoralen are easily obtained 
commercially and photochemistry is readily applied to multi- 
well plates containing many low-volume samples. Accordingly, 
a photochemical ligation strategy such as this appears to be 
well-suited for the generation of DNA-recorded chemical librar- 
ies so long as tag identities and associations can be determined 
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Figure 2. Psoralen-mediated photochemical oligonucleotide ligation. (A) The photo- 
chemical ligation of a 3'-thymidine oligonucleotide to a 5'-psoralen oligonucleotide 
after co-annealing to a complementary splint oligonucleotide (not shown). The stereo- 
chemistry of the thymidine-psoralen adduct may be either cis or trans. (B) Denaturing 
PAGE analysis of the time-course of the photochemical reaction that results in the liga- 
tion of a 3 '-thymidine oligonucleotide to a 5'-psoralen oligonucleotide with illumination 
at 365 nm, the gel is imaged by UV shadowing of a fluorescent plate. The appearance 
of the photochemical ligation product is indicated with a conversion of 50% occurring 
between two and four hours of illumination. 



by sequencing. The first step in the process of determining the 
sequence of photoligated concatenated psoralen oligonucle- 
otide tags is to generate a contiguous cDNA from which the 
sequences and associations of the photoligated tags can be 
deduced. To facilitate the subsequent separation of the cDNA 
from the photoligated template for analysis purposes the 5 '-ter- 
minus of the template was biotinylated, and to facilitate detec- 
tion of the cDNA the 5'-terminus of the terminal primer was 
fluoresceinylated. The photoligated conjugate was hybridized 
to the 5 '-terminal primer and the 5'-monophosphorylated relay 
primer. This hybridized complex was then incubated with T4 
DNA Polymerase to extend both the relay primer and the ter- 
minal primer. T4 DNA Polymerase was chosen because it does 
not exhibit strand-displacement activity 27 and so was expected 
to extend the 3 '-terminus of the terminal primer up to the 
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Figure 3. cDNA generation from chemically ligated tags using relay primer-mediated 
bypass on a photochemically ligated non-traversable template. (A) LC analysis showing 
approximately equal amounts of two primer-extension products after purification with 
detection at 495 nm. (B) Deconvoluted MS analysis of mixed primer-extension products 
showing two principal components with masses of 15,172 Da and 11,157 Da which corre- 
spond to full-length cDNA minus one dT, and primer extension up to the photochemical 
ligation junction respectively. 



5'-terminus of the relay primer without displacing it. Because 
the 5 '-terminus of the relay primer is 5'-monophosphorylated the 
junction between the extended relay primer and the extended 
terminal primer is a substrate for ligation with T4 DNA Ligase 
which was then added to the incubating mixture along with 
ATP. The steps in this process are outlined in Figure 1. Once the 
incubation was complete the hybridized complex was incubated 
with immobilized streptavidin to capture the chemically ligated 
conjugate and the cDNA was eluted with 0.1 M NaOH and 
neutralized. LCMS analysis is shown in Figure 3A and B with 
detection of the fluorescein at 495 nm indicating that approxi- 
mately 50% of the terminal primer had now been extended to 
generate a product with a mass close to that of full-length cDNA 
(observed deconvoluted MW 15 172 Da, calculated MW 15 459 
Da) with the observed mass suggesting the likely loss of one dT. 
The remaining approximately 50% corresponds to the exten- 
sion of the terminal primer up to the psoralen ligation junction 
(observed deconvoluted MW 11 157 Da, calculated MW 11 154 
Da). A control experiment in which the T4 DNA Ligase was 
omitted gave no full-length cDNA as expected. Sequencing 
indicated the cDNA had the expected sequence except for the 
absence of a 3 '-terminal dT along with a shorter cDNA that was 
terminated at the psoralen ligation junction. Of 34 full-length 



cDNA, all were missing a single dT from the 
3 '-terminal region with 33 of these contain- 
ing no more than one additional deletion or 
substitution. 

Generation of cDNA from a chemically 
ligated conjugate using repeat-dependent bypass 
Repeat-dependent bypass utilizes a "repeat" 
architecture in which the chemical ligation junc- 
tion is flanked upon either side by the same 
repeated sequence. In the presence of a hybridized 
upstream primer and a suitable polymerase the 
primer will be extended up to the chemical liga- 
tion junction and then stall having copied the first 
of the repeated sequences. If the rearrangement 
of this stalled complex results in hybridization of 
the 3 '-terminus of the newly synthesized cDNA 
to the second of the repeated sequences then tem- 
plate-dependent polymerization may continue. 
This process is shown in Figure 4. With multiple 
different repeats flanking multiple chemical liga- 
tion junctions multiple concatenated tags may be 
copied into a single cDNA. We have shown by 
experiment that this process readily occurs, does 
not lead to the shuffling of tag combinations in 
free solution, and we propose a mechanism by 
which it may occur. We refer to this method as 
repeat-dependent bypass and note it's similarity to 
the manner in which repeated sequences within 
genomic DNA can be contracted or expanded in 
processes that can lead to genomic instability 33,34 
and have been observed to occur in vitro to differ- 
ent extents with different polymerases. 35 

In order to demonstrate repeat-dependent 
bypass we successively used a Cu(I)-free triazole formation reac- 
tion followed by a Cu(I)-catalyzed triazole formation reaction 
thereby enabling the controlled sequential conjugation of a cen- 
tral oligonucleotide functionalized with both a 5'-alkynyl and a 
3'-azide first by reacting it at its 3'-terminus and then at its 5'-ter- 
minus as shown in Figure 5A. The chemical structure of each 
linkage is different and neither was expected to be traversable by 
a polymerase in the absence of a bypass mechanism because of the 
relatively large distance between the ligated bases and the fact that 
similar length triazole linkages have been shown not to function 
as templates for PCR amplification. 29 This was tested experimen- 
tally as described below. The Cu(I)-free ligation conversion was 
over 90% (Fig. 5B) and the Cu(I) -catalyzed ligation conversion 
was over 75% (Fig. 5C). After gel-purification and extraction we 
isolated 6 and 8 nmoles (40 and 53%) of each of two doubly con- 
jugated oligonucleotides — one constructed by conjugating (7) to 
(8) and (9) ("short-short") and the other by conjugating (7) to 
(10) and (11) ("long-long"). The final products were analyzed by 
LCMS with observed deconvoluted MW values of 38,572 and 
49,915 Da (calculated MW 38 560 and 49917 Da) as shown in 
Figure 5D and E. We screened a panel of DNA polymerases for 
their ability to read through the Cu (I) -catalyzed linkage formed 
by conjugating (7) and (9) including SuperScript III (Invitrogen), 
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Taq (NEB), Thermoscript (Invitrogen), 
KOD (Novagen), Vent (NEB), Vent 
(exo-) (NEB), Deep Vent (NEB) and 
Deep Vent (exo-) (NEB). We observed 
that PCR products were only generated 
with Vent (exo-) and Deep Vent (exo- 
). Similar constructs without a repeated 
sequence flanking the chemical ligation 
junction gave no PCR products under the 
same conditions. PCR amplification was 
conducted for each conjugate separately 
and for an equimolar mixture of the two 
conjugates using Deep Vent (exo-) DNA 
Polymerase. Analysis of the PCR products 
by agarose gel electrophoresis showed that 
the amplification of each of the isolated 
conjugates generates double-stranded PCR 
products with electrophoretic mobilities 
that correspond in each case to the loss of 
one of each of the repeated sequences in 
each of the pairs of ligation junctions — as 
shown in lanes 3 and 4 of Figure 6. The 
electrophoretic mobility values observed 
are consistent with the operation of the 
repeat-dependent bypass mechanism 
shown in Figure 5. The short-short con- 
jugate is 120 nucleotides in length with 
an expected PCR amplification product 
length of 96 base-pairs and the long-long 
conjugate is 156 nucleotides in length 

with an expected PCR amplification product length of 132 base- 
pairs. The mixed conjugate sample was also analyzed by cloning 
and sequencing. Of 35 sequence reads, 25 were observed to be 
of the long-long combination and 10 of the short-short combi- 
nation with no chimeric sequences observed with all sequence 
reads showing only one of each of the repeated sequences at each 
junction. 

Discussion 

Relay primer-dependent bypass 

In order to demonstrate cDNA generation using relay primer- 
dependent bypass and a template interrupted by a non-poly- 
merase traversable chemical ligation junction we first developed a 
method to generate a psoralen-thymidine ligated oligonucleotide. 
In contrast to the more well-known use of psoralen-modified 
oligonucleotides to generate psoralen-thymidine cyclobutane 
interstrand cross-links between hybridized oligonucleotides (see 
for example refs. 36 and 37) this method generates psoralen- 
thymidine cyclobutane linkages between adjacent nucleotides on 
the same strand and appears to be novel. This linkage is formed 
by co-hybridizing a 3'-thymidine oligonucleotide adjacent to a 
5'-psoralen oligonucleotide to a complementary splint oligonucle- 
otide that had no thymidines in closer proximity to the psoralen 
than four nucleotides away in the 3'-direction or two nucleotides 
away in the 5 '-direction (closer proximity was not tested), and 
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Figure 4. Proposed mechanism for the generation of cDNA from chemically ligated tags using 
repeat-dependent bypass. A terminal primer is annealed to the 3'-terminus of a chemically 
ligated oligonucleotide with a repeated sequence either side of the chemical ligation junction. 
Upon extension of this primer with a polymerase the newly generated cDNA stalls at the chemical 
ligation junction having copied the first of the repeated sequences. Rearrangement occurs with 
loop formation and the slippage of the terminus of the cDNA to become hybridized to the sec- 
ond instance of the repeated sequence, whereupon extension resumes and generates a full-length 
cDNA except for the deletion of one of the repeated sequences. 



then irradiating the complex with UV light. We were able to gen- 
erate this conjugate with isolated yields of approximately 80%. 
Yields of this magnitude translate into overall yields of 50% for 
the successive ligation of two tags and of 40% for the successive 
ligation of three tags. For DNA-encoded chemical library appli- 
cations these numbers of encoded cycles are of most interest to 
the medicinal chemistry community as they are more likely to 
lead to Lipinski-compliant compounds. 38 For cDNA generation 
a 5'-biotinylated version of the acceptor oligonucleotide was used 
(4) and the product was then isolated using denaturing PAGE. 
Using this template we were able to demonstrate the formation 
of cDNA with a yield of 50% and with the expected sequence. 
These observations demonstrate our ability to photochemically 
ligate 5'-psoralen oligonucleotides to 3'-thymidine oligonucle- 
otides using a complementary oligonucleotide splint (writing) 
and to then generate cDNA that permits the determination of 
the sequences and associations of ligated tags (reading). 
Repeat-dependent bypass 

In order to demonstrate the generation of cDNA using 
repeat-dependent bypass using a conjugate interrupted by a pair 
of non-polymerase-traversable chemical ligation junctions we 
first developed a method that allows for the controlled genera- 
tion of two triazole chemical ligation junctions within a single 
conjugate using a copper-free triazole formation from an azide 
and a strained alkyne 39 followed by a Cu(I)-mediated triazole 
formation similar to other reported reactions. 40,41 This approach 
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Figure 5A and B. (A) Controlled concatenation of three oligonucleotides using successive Cu(l)-free and Cu(l)-catalyzed chemical ligation by triazole 
formation. First the central oligonucleotide (7) is chemically ligated to (8) using the copper-free reaction of the azide group in (7) with the strained alkyne 
in (8), thereby preserving the terminal alkyne in (7) which is subsequently reacted in a Cu(l)-catalyzed reaction with the azide in the subsequently added 
third oligonucleotide (9). (B) LC time-course of Cu(l)-free conjugation of (7) with (8) using absorbance at 260 nm. 



permits the specific conjugation of three oligonucleotides with- 
out the need to use any protecting groups, also in high yield. For 
cDNA generation an oligonucleotide conjugate was generated 
by the chemical ligation of (7), (8) and (9) ("short-short") and 
used as a template for PCR along with a similar second con- 
jugate generated by the chemical ligation of (7), (10) and (11) 
("long-long"). Each of these conjugates has two chemical liga- 
tion junctions, within each conjugate each junction is flanked by 
a different repeated sequence, but these repeated sequences are 
the same for each of the long-long and short-short conjugates. 
Both conjugates were used to demonstrate that conjugates of 
this general design can be used to program template-dependent 



polymerase-dependent amplifications such as PCR. Because 
the conjugates are of different lengths we were also able to use 
them to demonstrate whether information encoded in ligated 
tag set combinations would be preserved during repeat-depen- 
dent bypass amplification, or alternatively whether it would be 
lost via recombination-mediated processes similar to those seen 
during DNA shuffling. 42 We were able to show that amplifi- 
cation using each of these templates resulted in amplification 
products containing only one of each repeated sequence at each 
junction, and that mixed amplification products were not gen- 
erated when the templates were mixed prior to amplification. 
These data are consistent with our proposed loop-formation/ 
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Figure 5C-E. (C) LC time-course of Cu(l)-catalyzed conjugation of (9) with the conjugate of (7) and (8) using absorbance at 260 nm. (D) LC analysis of gel- 
purified doubly-conjugated products (7)+(8)+(9) and (7)+(1 0)+(1 1) using absorbance at 260 nm. (E) Deconvoluted MS of gel-purified doubly-conjugated 
products (7)+(8)+(9) "short-short" and (7)+(10)+(11) "long-long." 



slippage repeat-dependent bypass model (Fig. 4) and suggest 
that if recombination processes are occurring they are relatively 
infrequent. Accordingly sequence data derived from the amplifi- 
cation of chemically ligated conjugates using this cDNA-genera- 
tion process can be used to infer the identities and co-association 
statistics of ligated oligonucleotides present in the input conju- 
gates — as is required for the successful application of this strat- 
egy to the encoding of the chemical history of individual library 
members within DNA-encoded libraries. 



Material and Methods 

Synthesis of an oligonucleotide conjugate with one psoralen- 
thymidine photochemical ligation junction 

Oligonucleotides (1), (2) and (3) (shown in Scheme 1) were 
acquired from Integrated DNA Technologies. All three were 
mixed together in aqueous 500 mM sodium phosphate at pH 7.0 
with the psoralen oligonucleotide (1) at 1 mM and the other two 
oligonucleotides at 1.1 mM. The mixture was then briefly heated 
to 95 °C followed by slow cooling to room temperature to allow 
hybridization to occur. Ten microliters aliquots were then cooled 
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Figure 6. Demonstration of the fidelity of cDNA generation from chemically ligated tags using repeat-dependent bypass. A pair of doubly-conjugated 
oligonucleotides was prepared - one by the chemical ligation of two short oligonucleotides to a central oligonucleotide and the other by the chemi- 
cal ligation of two long oligonucleotides to the same central oligonucleotide. The corresponding junction in each conjugate was flanked by the same 
repeated sequence. cDNA generation and subsequent PCR and electrophoresis using ethidium-stained agarose indicates that each of the conjugates 
may be amplified using relay primer-mediated bypass and that amplification occurs without the occurrence of homology-mediated shuffling, as indi- 
cated by the presence of bands with mobilities corresponding to amplification products containing "long-long" and "short-short" tag combinations, 
and the absence of the chimeric amplification products "long-short" and "short-long." 



to 4 °C and irradiated in 1.5 ml polypropylene microcentrifuge 
tubes (Fisher Scientific, 02-682-550) with UV light at 365 nm 
using a UVL-21 compact UV lamp (UVP). Aliquots of 1 uL 
were taken over a time-course and were analyzed by denaturing 
PAGE. The gel was visualized and photographed using the UV 
shadowing of a fluorescent TLC plate. 

Generation of cDNA from an oligonucleotide conjugate 
with one psoralen-thymidine photochemical ligation junction 

A gel-purified conjugate of (1) and (4) was then incubated 
in a 100 ul volume with a 5'-monophosphorylated relay primer 
(5) and a 5'-FAM-labeled terminal primer (6) each at 10 uM 
in T4 DNA Ligase Buffer (NEB) with 1 mM of each dNTP 
and 30 units of T4 DNA Polymerase (NEB). Following incu- 
bation for 1 h at 37 °C the mixture was supplemented with 
an additional 0.5 mM ATP and 10 units of T4 DNA Ligase 
(NEB) were added and it was then incubated for an additional 
1 h at 37 °C. The reaction product was then purified by incu- 
bation with 200 u-1 of streptavidin-coated Dynabeads M280 
(Invitrogen), washed, eluted with 35 u.1 of 0.1 M NaOH fol- 
lowed by neutralization with 1M pH 7.0 tris(hydroxymethyl) 
aminomethane. The product was then analyzed by LCMS on 
a Thermo Scientific LCQ Fleet using an ACE 3 C18-300 (50 
x 2.1 mm) column and a 5 min gradient of 5—35% of buffer 
B using buffer A (1% hexafluoroisopropanol [HFIP], 0.1% 
di-isopropylethyl amine [DIEA], 10 uM EDTA in water) and 



buffer B (0.075% HFIP, 0.0375% DIEA, 10 (jlM EDTA, 65% 
acetonitrile/35% water). LC was monitored at 260 nm and 488 
or 495 nm. MS was detected in the negative mode, and mass 
peak deconvolution was performed using ProMass software. In 
order to determine the sequence of both cDNA regions that were 
generated as a result of primer extensions, the 3 '-terminus of the 
cDNA was first ligated to a new primer-binding oligonucleotide 
5'-monophospho-GCTGTGCAGG TAGAGTGC-FAM-3' 
using single-stranded enzymatic ligation with 1 unit/ul of T4 
RNA Ligase 1 (NEB,) in lx T4 RNA Ligase buffer (NEB), sup- 
plemented with 200 uM ATP (NEB), 1 mM hexamine cobalt 
chloride (Sigma-Aldrich) and 25% PEG8000 (NEB) for 16 h at 
room temperature. This newly ligated cDNA was then purified 
using denaturing PAGE with fluorescence to visualize the band 
of interest followed by excision, extraction and PCR amplifica- 
tion was then conducted using the ligated cDNA at 200 pM, 
with the forward primer 5'-GCACTCTACC TGCACAGC-3' 
and the reverse primer 5'-GGCAGTACGC AAGCTCG-3' 
each at 0.5 uM and Platinum Supermix PCR mix (Invitrogen). 
Sixteen thermal cycles were performed with denaturation at 94 
°C for 30 s, annealing at 56 °C for 30 s and extension at 72 °C 
for 60 s. Amplification products were visualized on ethidium 
bromide-stained 4% agarose E-Gels (Invitrogen) with illumi- 
nation at 305 nm. Subsequently 0.2 units/u,l of Klenow DNA 
Polymerase (NEB) were added to the PCR reaction followed by 
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°'' o-5'-GAGCGGATGCAAAAAAAAAGGCGAGCTTGCGTACTG-3' 



2 5'-AAAAACTCAGGT-3' 



3 5'-TCCGCTCGACCTGAG-3' 




'o- p; o-5'-AAAAACTCAGGT-3' 



5 °" x o-5'-TCCGCTCGACCTGAG-3' 




<A>-5'-AGTACGCAAGCTCGC-3' 



P.- P" 

V Px o-5'-GGAATGAACAGGGTAAGCTGGAGTGAAGGCGTTATG-3'-azido 



8 o 5 °" °-5 -G AAGGCGTTATGTCCGTACTCTTG CAATCG G GTG CTATG CTT-3 



5'-TGCGGTCTAACTGTCTAGTTCACCTTCTCCGGAATGAACAGG-3'-azido 




10 rf 



<A>-5' -GAAG G CGTTATGTG ATATCCGTGGTGTCGTG AGTTCCAATCG G GTG CTATGCTT-3' 



11 5'-TGCGGTCTAACTGTCTAGGCACTTGTTCGTTTGCCAGTGTGATGGACACCACTTGGAATGAACAGG-3'-azido 



Scheme 1. Sequences and chemical modifications of oligodeoxynucleotides used in this study. 



incubation for 15 min at 37 °C. PCR products were then cloned 
using a Zero Blunt TOPO cloning kit (Invitrogen) and these 
clones were Sanger sequenced (Genewiz). 

Synthesis of an oligonucleotide conjugate with two differ- 
ent triazole chemical ligation junctions 

Oligonucleotides (7), (8) and (9) were acquired from 
Integrated DNA Technologies (IA, USA). Oligonucleotides (7) 
and (8) were mixed together at 0.5 mM concentration in aque- 
ous solution with 0.2 M sodium phosphate buffer at pH 7.0. 
The reaction progress was monitored by LCMS. To this mix- 
ture was then added oligonucleotide (9) and a Cu (I) -catalyzed 
triazole formation reaction was conducted by further incuba- 
tion with two molar equivalents of Cu(II) acetate, four molar 
equivalents of sodium ascorbate and one molar equivalent of 
Tris-(BenzylTriazolylmethyl) Amine (TBTA) in a volume of 



50 ul with oligonucleotides at a concentration of 0.5 mM. This 
mixture was incubated overnight at room temperature followed 
by LCMS and purification using denaturing PAGE. 

Generation of cDNA from an oligonucleotide conjugate 
with two triazole chemical ligation junctions 

PCR amplifications were conducted with conjugates at 40 pM, 
with the forward primer 5'-TGCGGTCTAA CTGTCTA-3' 
and the reverse primer 5'- AAGCATAGCA CCCGATT-3' each 
at 0.5 uM using 0.1 units/ul of Deep Vent (exo-) Polymerase 
(NEB) in lx Thermopol Buffer (NEB) with dNTP at 0.5mM. 
Twenty-two thermal cycles were performed with denaturation at 
94 °C for 30 s, annealing at 52 °C for 30 s and extension at 72 
°C for 60 s. Amplification products were visualized on ethidium 
bromide-stained 4% agarose E-Gels (Invitrogen) with illumina- 
tion at 305 nm. PCR products were cloned using a Zero Blunt 
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TOPO cloning kit (Invitrogen) and these clones were Sanger 
sequenced (Genewiz). 

Conclusion 

There is significant and increasing interest in the use of the 
affinity-mediated selection of DNA-encoded libraries of small 
molecules to discover inhibitors of proteins of therapeutic inter- 
est. In order to construct such libraries efficiently it is necessary 
to utilize robust methods for the ligation of oligonucleotides that 
are in turn able to support the determination of oligonucleotide 
tag sequences and associations while not interfering with the 
wide range of chemical reactions that will be used in the syn- 
thesis of the encoded library members. In this study we have 
demonstrated three different chemical ligation strategies — the 
uncatalyzed cycloaddition of a strained alkyne and an azide, the 
Cu (I) -catalyzed cycloaddition of a terminal alkyne and an azide 
and the photochemical cycloaddition of thymidine and pso- 
ralen. Each of these chemical ligations may be used as part of an 
encoding strategy for the chemical synthesis of encoded librar- 
ies. We have also developed two independent methods that may 
be used to generate cDNA that incorporates sequence informa- 
tion derived from oligonucleotide tags concatenated using these 
chemical ligations. cDNA generated by these methods may be 
amplified, cloned and sequenced and used to infer identity and 
statistical information about the output populations of molecules 
emerging from affinity-mediated selection experiments - and 
thereby used to identify small molecules that may inhibit thera- 
peutic protein targets of interest. The presented methods for the 
generation of cDNA include two alternative strategies, neither of 
which depends upon the ability of a polymerase to translocate 
through the chemical ligation junction, and both of which may 
be considered universal. One method utilizes a 5'-phosphorylated 
relay primer that is hybridized across the ligation junction and 
extended by a polymerase downstream while an upstream primer 
is extended up to it to form a nick. Ligation then delivers a single 
contiguous cDNA that may then be amplified by PCR, cloned 
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